Baby Talk: Understanding and Generating Image Descriptions

نویسندگان

  • Girish Kulkarni
  • Visruth Premraj
  • Sagnik Dhar
  • Siming Li
  • Yejin Choi
  • Alexander C Berg
  • Tamara L Berg
چکیده

We posit that visually descriptive language offers computer vision researchers both information about the world, and information about how people describe the world. The potential benefit from this source is made more significant due to the enormous amount of language data easily available today. We present a system to automatically generate natural language descriptions from images that exploits both statistics gleaned from parsing large quantities of text data and recognition algorithms from computer vision. The system is very effective at producing relevant sentences for images. It also generates descriptions that are notably more true to the specific image content than previous work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions

The past few years have witnessed renewed interest in NLP tasks at the interface between vision and language. One intensively-studied problem is that of automatically generating text from images. In this paper, we extend this problem to the more specific domain of face description. Unlike scene descriptions, face descriptions are more fine-grained and rely on attributes extracted from the image...

متن کامل

Labeling Images by Interpretation from Natural Viewing

In this paper, we would like to discuss the connection between visual processing and the understanding of an image. While the information of image viewing can be obtained from subjects’ eye fixation, the understanding of an image can be obtained from the subjects’ description of the given image. Furthermore, we proposed a new image labeling method based on the connection between eye fixation an...

متن کامل

Demystify False Dilemmas to Speak About Corruption in Health Systems: Different Actors, Different Perspectives, Different Strategies; Comment on “We Need to Talk About Corruption in Health Systems”

The call of the editorial of the International Journal of Health Policy and Management regarding the “Need to talk about corruption in health systems” is spot on. However, the perceived difficulties of why this is so should be explored from an actor’s perspective, as they differ for government actors, donors and the research community. In particular, false dilemmas around definition pr...

متن کامل

O-24: Single Oocyte Secretoma Mapping by NMR-Metabolomics Technology: A Non-Invasive Strategy to Select The Best Oocytes to Fertilize Avoiding Supernumerary Embryos and Increasing Take- Home-Baby-Rate after IVF

Background Current strategies based on random selection of MII-oocytes to fertilize appear unsatisfactory in selecting the best number and the most vital oocytes to fertilize especially in poor responder women in which “chronological age” does not mismatch with “biological age”. The metabolomics- profiling approach, evaluating the final products of cell regulatory process (genome/transcriptome/...

متن کامل

Midge: Generating Image Descriptions From Computer Vision Detections

This paper introduces a novel generation system that composes humanlike descriptions of images from computer vision detections. By leveraging syntactically informed word co-occurrence statistics, the generator filters and constrains the noisy detections output from a vision system to generate syntactic trees that detail what the computer vision system sees. Results show that the generation syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011